Logical Methods in Computer Science 

Vol. 7 (3:14) 2011, pp. 1-24 Submitted Nov. 17, 2010 

www.lmcs-online.org Published Sep. 21,2011 



RANDOMISATION AND DERANDOMISATION IN DESCRIPTIVE 

COMPLEXITY THEORY 



KORD EICKMEYER AND MARTIN GROHE 

Humboldt-Universitat zu Berlin, Institut fiir Informatik, Logik in der Informatik 

Unter den Linden 6, 10099 Berlin, Germany 

e-mail address: eickmeye@inforniatik.hu-berlin.de, grohe@informatik.hu-berlin.de 



Abstract. We study probabilistic complexity classes and questions of derandomisation 
from a logical point of view. For each logic L we introduce a new logic BPL, bounded 
error probabilistic L, which is defined from L in a similar way as the complexity class BPP, 
bounded error probabilistic polynomial time, is defined from P. 

Our main focus lies on questions of derandomisation, and we prove that there is a 
query which is definable in BPFO, the probabilistic version of first-order logic, but not 
in CJ^jj, finite variable infinitary logic with counting. This implies that many of the 
standard logics of finite model theory, like transitive closure logic and fixed-point logic, 
both with and without counting, cannot be derandomised. Similarly, we present a query 
on ordered structures which is definable in BPFO but not in monadic second-order logic, 
and a query on additive structures which is definable in BPFO but not in FO. The latter 
of these queries shows that certain uniform variants of AC' (bounded-depth polynomial 
sized circuits) cannot be derandomised. These results are in contrast to the general belief 
that most standard complexity classes can be derandomised. 

Finally, we note that BPIFP+C, the probabilistic version of fixed-point logic with count- 
ing, captures the complexity class BPP, even on unordered structures. 



1. Introduction 

The relation between different modes of computation — deterministic, nondeterministic, 
randomised — is a central topic of computational complexity theory. The P vs. NP problem 
falls under this topic, and so does a second very important problem, the relation between 
randomised and deterministic polynomial time. In technical terms, this is the question 
of whether P = BPP, where BPP is the class of all problems that can be solved by a 
randomised polynomial time algorithm with two-sided errors and bounded error probability. 
This question differs from the question of whether P = N P in that most complexity theorists 
seem to believe that the classes P and BPP are indeed equal. This belief is supported by 
deep results due to Nisan and Wigderson [31] and Impagliazzo and Wigderson j20j . which 
link the derandomisation question to the existence of one-way functions and to circuit lower 
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bounds; cf. also [21]. Similar derandoniisation questions are studied for other complexity 
classes such as logarithmic space, and it is believed that derandoniisation is possible for 
these classes as well. 

Descriptive complexity theory gives logical descriptions of complexity classes and thus 
enables us to translate complexity theoretic questions into the realm of logic. While logical 
descriptions are known for most natural deterministic and nondeterministic time and space 
complexity classes, probabilistic classes such as BPP have received very little attention in 
descriptive complexity theory yet. In this paper, we study probabilistic complexity classes 
and questions of derandoniisation from a logical point of view. For each logic L we introduce 
a new logic BPL, bounded error probabilistic L, which is defined from L in a similar way as 
BPP is defined from P. The randomness is introduced to the logic by letting formulas of 
vocabulary r speak about random expansions of r-structures to a richer vocabulary r U /?. 
We also introduce variants RL, CO-RL with one-sided bounded error and PL with unbounded 
error, corresponding to other well known complexity classes. 

Our main technical results are concerned with questions of derandoniisation. By this we 
mean upper bounds on the expressive power of randomised logics in terms of classical logics. 
Trivially, BPL is at least as expressive as L, and if the two logics are equally expressive, 
then we say that BPL derandomisable. More generally, if L is a (deterministic) logic that 
is at least as expressive as BPL, then we say that BPL derandomisable within L . We prove 
that BPFO, bounded error probabilistic first-order logic, is not derandomisable within C^^, 
finite variable infinitary logic with counting. This implies that many of the standard logics 
of finite model theory, like transitive closure logic and fixed-point logic, both with and 
without counting, cannot be derandomised. Note that these results are in contrast to the 
general belief that most standard complexity classes can be derandomised. 

We then investigate whether BPFO can be derandomised on classes of structures with 
built-in relations, such as ordered structures and arithmetic structures. We prove that 
BPFO cannot be derandomised within MSO, monadic second-order logic, on structures 
with built-in order. Furthermore, BPFO cannot be derandomised on structures with built- 
in order and addition. Interestingly and nontrivially, BPFO can be derandomised within 
MSO on structures with built-in order and addition. Behle and Lange [5] showed that the 
expressive power of FO on classes of ordered structures with certain predefined relation 
symbols corresponds to uniform subclasses of AC , the class of problems decidable by cir- 
cuit families of bounded depth, unbounded fan-in and polynomial size. In fact, for any 
set TZ of built-in relations they show that F0[7?.] captures FO[7?.]-uniform AC . Arguably 
the most intensively studied uniformity condition on AC is dlogtime-uniform AC , which 
corresponds to F0[-|-, x], first-order logic with built-in arithmetic (Barrington et al. [3]). 
The question of whether dlogtime-uniform BPAC can be derandomised is still open, but 
there is a conditional derandomisation by Viola j39]. There are less uniform variants of 
BPAC that can be proved to be derandomisable by standard arguments; cf. [T]. We prove 
that the more uniform FO[-|-]-uniform AC is not derandomisable. This raises the question 
of how weak uniformity must be for derandomisation to be possible. 

In the last section of this paper, we turn to more standard questions of descriptive com- 
plexity theory. We prove that BPIFP-i-C, the probabilistic version of fixed-point logic with 
counting, captures the complexity class BPP, even on unordered structures. For ordered 
structures, this result is a direct consequence of the Immerman-Vardi Theorem |18(l38j. and 
for arbitrary structures it follows from the observation that we can define a random order 
with high probability in BPIFP-i-C. Still, the result is surprising at first sight because of its 
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similarity with the open question of whether there is a logic capturing P, and because it is 
believed that P = BPP. The caveat is that the logic BPIFP+C does not have an effective 
syntax and thus is not a "logic" according to Gurevich's [16] definition underlying the ques- 
tion for a logic that captures P. Nevertheless, we believe that BPIFP+C gives a completely 
adequate description of the complexity class BPP, because the definition of BPP is inher- 
ently ineffective as well (as opposed to the definition of P in terms of the decidable set of 
polynomially clocked Turing machines). We obtain similar descriptions of other probabilis- 
tic complexity classes. For example, randomised logspace is captured by the randomised 
version of deterministic transitive closure logic with counting. 

Related work 

As mentioned earlier, probabilistic complexity classes such as BPP have received very little 
attention in descriptive complexity theory. There is an unpublished paper due to Kaye [22] 
that gives a logical characterisation of BPP on ordered structures. Miiller |30j and Mon- 
toya (unpublished) study a logical BP-operator in the context of parameterised complexity 
theory. What comes closest to our work "in spirit" and also in some technical aspects is 
Hella, Kolaitis, and Luosto's work on almost everywhere equivalence [IT], which may be 
viewed as a logical account of average case complexity in a similar sense that our work 
gives a logical account of randomised complexity. There is a another logical approach to 
computational complexity, known as implicit computational complexity, which is quite dif- 
ferent from descriptive complexity theory. Mitchell, Mitchell, and Scedrov |28j give a logical 
characterisation of BPP by a higher-order typed programming language in this context. 

Let us emphasise that the main purpose of this paper is not the definition of new 
probabilistic logics, but an investigation of these logics in a complexity theoretic context. 

2. Preliminaries 

2.1. Structures and Queries 

A vocabulary is a finite set r of relation symbols of fixed arities. A t- structure A consists 
of a finite set V{A), the universe of the structure, and, for all R £ t, a relation R{A) on 
A whose arity matches that of R. Thus we only consider finite and relational structures. 
Let a, T be vocabularies with a Q t. Then the a -restriction of a r-structure B is the a- 
structure B\a- with universe V{B\cj) := V{B) and relations R{B\(j) := R{B) for all R £ a. 
A T-expansion of a a-structure A is a r-structure B such that B\u = A. For every class C of 
structures, C[t] denotes the class of all r-structures in C. A renaming of a vocabulary r is 
a bijective mapping r from r to a vocabulary r' such that for all i? G r the relation symbol 
r{R) £ r' has the same arity as R. If r : r — ?■ r' is a renaming and ^4 is a r-structure then 
A"" is the r'-structure with V{A'') := V{A) and r{R){A'') := R{A) for ah i? G r. 

We let ^, + and x be distinguished relation symbols of arity two, three and three, 
respectively. Whenever any of these relations symbols appear in a vocabulary r, we demand 
that they be interpreted by a linear order and ternary addition and multiplication relations, 
respectively, in all r-structures. To be precise, let [a, 6] be the set {a^a + 1,...,6} for 



a < 6 G N, and denote by J\fn the {^, +, x}-structure with 

V[Mn) = [0,n - 1], ^(AA„) = {(a, 6) \a^h} and 

+(A/'n) = {{a,h,c) \a + h = c}, x{Mn) = {{a,b,c) \a-b = c}. 

We demand A|{^^^ x}nr — (■^yl|)l{^,+,x}nT ^^^ ^^^ r-structures A. We call structures whose 
vocabulary contains any of these relation symbols ordered, additive and multiplicative, re- 
spectively. We say that a formula ^{x) with exactly one free variable x defines an element 
if in every structure it is satisfied by exactly one element. Since we may identify the ele- 
ments of an ordered structure uniquely with natural numbers it makes sense to say, e.g., 
that 'V(^) defines a prime number" or 'VC^) defines a number < log^ ' 1^1" i ^-i^d we will 
sometimes do so. 

On ordered structures, every fixed natural number i can be defined in first-order logic 
by a formula (/?i-th using only three variables as follows: 

^o-th{x) ■■=yyx <y 

V'in+iythix) ■■= ^y^z (v3„,-th(y) a -^{x=y) Ay <xA 

{{y < z Az <x) ^ {y=z V y=z))). 

Because the ordering may be defined using the addition relation, the same holds true on 
additive structures, again using only three variables. 

A k-ary T-global relation is a mapping 7?. that associates a fc-ary relation TZ{A) with 
each r-structure A. A 0-ary r-global relation is usually called a Boolean r-global relation. 
We identify the two 0-ary relations and {()}, where () denotes the empty tuple, with the 
truth values false and true, respectively, and we identify the Boolean r-global relation TZ 
with the class of all r-structures A with TZ{A) = true. A k-ary r-query is a fc-ary r-global 
relation Q preserved under isomorphism, that is, if / is an isomorphism from a r-structure 
A to a r-structure B then for all a £ V{A)'' it holds that a G Q{A) ^^ f{a) £ Q{B). 

2.2. Logics 

A logic L has a syntax that assigns a set L[r] of L-formulas of vocabulary r with each 
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vocabulary r and a semantics that associates a r-global relation Q^ with every formula 
<p £ L[r] such that for all vocabularies cr,T,T' the following three conditions are satisfied: 

(1) For all ip G L[r] the global relation Q^p is a r-query. 

(2) If o" C r then L[a] C L[r], and for all formulas ip G \-[a] and all r-structures A it holds 
thatQ^^''\A\^) = Q'j^\A). 

(3) If r : r — )• r' is a renaming, then for every formula ip G L[r] there is a formula if"^ G L[r'] 
such that for aU r-structures A it holds that Q^'^'(A) = Q^r'\A''). 

Condition (ii) justifies dropping the vocabulary r in the notation for the queries and just 
write QJt,. For a r-structure A and a tuple a whose length matches the arity of Q\\, we 
usually write A \=]_ ip[a] instead of a G QL{A). If Q\\ is a fe-ary query, then we call ip a 
k-ary formula, and if Qz, is Boolean, then we call ip a sentence. Instead of A |=l y^[{)] we 
just write A \=i ip and say that A satisfies ip. We omit the index L if L is clear from the 
context. 

A query Q is definable in a logic L if there is an L- formula ip such that Q = Q^. Two 
formulas ipi,ip2 G L[r] are equivalent (we write ifi = 992) if they define the same query. We 



say that a logic Li is weaker than a logic L2 (we write Li ^ L2) if every query definable in 
Li is also definable in L2. Similarly, we define it for Li and L2 to be equivalent (we write 
Li = L2) and for Li to be strictly weaker than L2 (we write Li ^ L2). The logics Li and L2 
are incomparable if neither Li ^ L2 nor L2 ^ Li. 

Remark 2.1. Our notion of logic is very minimalistic, usually logics are required to meet 
additional conditions (see [8] for a thorough discussion). In particular, we do not require 
the syntax of a logic to be effective. Indeed, the main logics studied in this paper have an 
undecidable syntax. Our definition is in the tradition of abstract model theory (cf. H]); 
proof theorists tend to have a different view on what constitutes a logic. 

We assume that the reader has heard of the standard logics studied in finite model 
theory, specifically first- order logic FO, second-order logic SO and its fragments S]., monadic 
second-order logic MSO, transitive closure logic TC and its deterministic variant DTC, least, 
inflationary, and partial fixed-point logic LFP, IFP, and PFP, and finite variable infinitary 
logic L^^. For all these logics except LFP there are also counting versions, which we denote 
by FO+C, TC+C, . . ., PFP+C and C^^, respectively. Only familiarity with first-order logic 
is required to follow most of the technical arguments in this paper. The other logics are 
more or less treated as "black boxes". We will say a bit more about some of them when 
they occur later. The following diagram shows how the logics compare in expressive power: 

FO < DTC < TC < LFP = IFP < PFP < L^^ 
< < < < < < (2.1) 

FC?+C < DTC+C < T(?+C < IFP+C < PFP+C < c£^. 

Furthermore, MSO is strictly stronger than FO and incomparable with all other logics 
displayed in (|2.ip . 



2.3. Complexity theory 

We assume that the reader is familiar with the basics of computational complexity theory 
and in particular the standard complexity classes such as P and NP. Let us briefly review the 
class BPP, bounded error probabilistic polynomial time, and other probabilistic complexity 
classes: A language L C S* is in BPP if there is a polynomial time algorithm M, expecting 
as input a string x G S* and a string r G {0, 1}* of "random bits", and a polynomial p such 
that for every x £ T,* the following two conditions are satisfied: 

(i) If X G L, then Pr^gio.ijpCI^I) {M accepts (x,r)) > |. 

(ii) If X ^ L, then Prre{o,i}p(l^l) (^ accepts {x,r)) < i. 

In both conditions, the probabilities range over strings r G {0, 1}p(I^I) chosen uniformly at 
random. The choice of the error bounds 1/3 and 2/3 in (i) and (ii) is somewhat arbitrary, 
they can be replaced by any constants a, j3 with < a < /3 < 1 without changing the 
complexity class. (To reduce the error probability of an algorithm we simply repeat it 
several times with independently chosen random bits r.) 

Hence BPP is the class of all problems that can be solved by a randomised polynomial 
time algorithm with bounded error probabilities. RP is the class of all problems that can 
be solved by a randomised polynomial time algorithm with bounded one-sided error on 
the positive side (the bound 1/3 in (ii) is replaced by 0), and CO-RP is the class of all 
problems that can be solved by a randomised polynomial time algorithm with bounded 
one-sided error on the negative side (the bound 2/3 in (i) is replaced by 1). Finally, PP 
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is the class we obtain if we replace the lower bound > 2/3 in (i) by > 1/2 and the upper 
bound < 1/3 in (ii) by < 1/2. Note that PP is not a realistic model of "efficient randomised 
computation" , because there is no easy way of deciding whether an algorithm accepts or 
rejects its input. Indeed, by Toda's Theorem [37], the class P contains the full polynomial 
hierarchy. By the Sipser-Gacs Theorem (see [23]), BPP is contained in the second level of 
the polynomial hierarchy. More precisely, BPP C S2 H 112. ^^ ^^ ^^ open question whether 
BPP C NP. However, as pointed out in the introduction, there are good reasons to believe 
that BPP = P. 

2.4. Descriptive complexity 

It is common in descriptive complexity theory to view complexity classes as classes of 
Boolean queries, rather than classes of formal languages. This allows it to compare logics 
with complexity classes. The translation between queries and languages is carried out as 
follows: Let r be a vocabulary, and assume that ^ r. With each ordered (r U {^})- 
structure B we can associate a binary string s{B) G {0, 1}* in a canonical way. Then with 
each class C C ©[rUJ^}] of ordered r structures we associate the language L{C) := {s{B) \ 
B eC} C {0, 1}*. For a Boolean r-query Q, let Q^ := {B E 0[t U ^] | B\r G Q} be the 
class of all ordered (r U {^})-expansions of structures in Q. We say that Q is decidable in a 
complexity class K if the language L(Q^) is contained in K. We say that a logic L captures 
K if for all Boolean queries Q it holds that Q is definable in L if and only if Q is decidable 
in K. We say that L is contained in K if all Boolean queries definable in L are decidable in 
K. 

Remark 2.2. Just like our notion of "logic", our notion of a logic "capturing" a complexity 
class is very minimalistic, but completely sufficient for our purposes. For a deeper discussion 
of logics capturing complexity classes we refer the reader to one of the textbooks [O] (El (IHl 



3. Randomised logics 

Throughout this section, let r and p be disjoint vocabularies. Relations over p will be 
"random" , and we will reserve the letter R for relation symbols from p. We are interested 
in random (r U p)-expansions of r-structures. For a r-structure A, by X(A,p) we denote 
the class of all (r U /9)-expansions of A. We view X(A, p) as a probability space with the 
uniform distribution. Note that we can "construct" a random X G X{A^p) by deciding 
independently for all /c-ary R ^ p and all tuples a G V{A)^ with probability 1/2 whether 
a G R{X). Hence if p = {Ri, . . . , Rk}, where Ri is r^-ary, then a random X G X{A, p) can 
be described by random bitstring of length X]i=i '^''S where n := \V{A)\. We are mainly 
interested in the probabilities 

Pr (X ^ ^) 

xeXiA,p) 

that a random (r U /9)-expansion of a r-structure A satisfies a sentence if of vocabulary rUp 
of some logic. 

Definition 3.1. Let L be a logic and < a < /5 < 1. 



(1) A formula ip G L[rUp] that defines a /c-ary query has an (a, pi\-gap if for all r-structures 
A and all a G V{A)^ it holds that 

V IJ. S^ ^ '^["l) < a or Pr (X ^ (^[a]) > /3. 

XeA'(yl,p) X(^X{A,p) 

(2) The logic P^^^^iL is defined as follows: For each vocabulary r, 

P{«,/3]LH :=IJ{'^ ^ '-[rUp] I 93 has an (a,/3]-gap}, 
p 
where the union ranges over all vocabularies p disjoint from r. To define the semantics, 
let if G P(q /3]L[t]. Let k, p such that ip G L[rU/9] and ip is A;-ary. Then for all r-structures 
A, 

Q^<-^'^(^) := {a G y(^)^ | ^^^^^J^ |=L V^[a]) > /3}. 

It is easy to see that for every logic L and all a,j3 with < a < /3 < 1 the logic P(q,^^iL 
satisfies conditions (i)-(iii) from Subsection 12.21 and hence is indeed a well-defined logic. We 
let 

PL := P(i/2,i/2]L and RL := P(o,2/3]L and BPL := P(i/3^2/3]L. 
We can also define a logic P[q^^)L and let CO-RL := P[i/3 i)L. The following lemma, which is 
an adaptation of classical probability amplification techniques to randomised logics, shows 
that for reasonable L the strength of the logic P((j^^]L does not depend on the exact choice 
of the parameters q, /3. This justifies the arbitrary choice of the constants 1/3, 2/3 in the 
definitions of RL and BPL. 

Lemma 3.2. Let L be a logic that is closed under conjunctions and disjunctions. Then for 
all a,/3 with 0<a</3<l it holds that P(o,/3]L = RL and P/^^f^iL = BPL. 

Proof. Let r an /) = {Ri, . . . ,Rk} be disjoint relational vocabularies and let ip G L[r U p]. 
For any n > 1 we define a new vocabulary 

p{n) --{R^p I 1 <i<n,l <i< A;}, 
where the arity of R- is that of Rj G p. Using the renaming property with the renaming 

that leaves r fixed and maps Rj G p to R^ we get sentences <p^^\ which are the sentence 
ip with every occurrence of Rj replaced by R^ . Since L is closed under conjunctions and 
disjunctions, for every < / < n there is an L[r U p'"'] -sentence 

/C[n] J€/ 

which is satisfied iff at least I of the ip^^' are satisfied. Notice that the ip^"^' use distinct 
random relations, so they are satisfied independently of each other. 

Clearly, if Pr(X |= c/?) = then also Pr(X |= c^'"' )) = 0, because we assumed / > 1. 
On the other hand, if Pr(X |= (/?) > /3 for some /3 G (0, 1), then 

Pr(X 1= v^^"'^)) = 1 - (1 - Pr(X 1= ip)Y (3.1) 

>l-(l-/3r, (3.2) 
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and this bound can be made arbitrarily close to 1 by choosing n sufficiently large. This 
proves the claim about RL. 

For BPL, notice that if if has an (a,/3]-gap for some any < a < /3 < 1, then for any 
< q' < /?' < 1 there is an n G N such that 



has an (a',/3']-gap. In fact, the Chernoff bound (see, e.g., [29]) gives very sharp estimates 
on n in terms of a, /3, a' and /3', though we only need the mere existence of such an n 
here. D 



3.1. First observations 

We start by observing that the syntax of BPFO and thus of most other logics BPL is unde- 
cidable. This follows easily from Trakhtenbrot's Theorem (see [9] for similar undecidability 
proofs): 

Observation 3.3. For all a, f3 with < a < /3 < 1 and all vocabularies r containing at 
least one at least binary relation symbol, the set BP/Q,j3iF0[r] is undecidable. 

Proof Sketch. Assume for some < a < P < 1 and some r containing a binary relation 
symbol E the set BP(Q,j3]F0[r] is decidable. 

By Trakhtenbrot's Theorem (cf. [9l Thm. 7.2.1]), the satisfiability of a first-order 
formula ip £ FO[t] on finite graphs is undecidable. Let Q be the class of all graphs with 
exactly one isolated vertex, and let ipg be a sentence defining Q on finite structures. By 
standard arguments, whether a formula is satisfiable in Q or on is undecidable. 

Let p = a ■ 2~ G (a, /?) with o S N be a dyadic rational in the interval {a, /3), and let 
Ri, . . . ,Rk be unary random relations. For every S C [k], the sentence 

V's := 3x (yy^Exy) A f\ RiX A /\ ^RiC 

has satisfaction probability 2"'^ in all structures in Q. Thus for a family S = {Si, . . . , Sa} 
of a distinct subsets of [k] , the sentence 

"05 := V V's 
SeS 

is satisfied with probability p on such structures. But now the sentence 

V'g -^ (x A ^5) 
is in BP(cf,/3]F0[T] if and only if x is not satisfiable on ^. D 

For each n, let Sn be the 0-structure with universe V{Sn) '■= {I,--- jn}. Recall the 
0-1-law for first order logic |12t ll4|. In our terminology, it says that for each vocabulary p 
and each sentence if G FO[p] it holds that 

lim Pr {X^^)e{0,l} 

(in particular, this limit exists). There is also an appropriate asymptotic law for formulas 
with free variables. This implies that on structures with empty vocabulary, PFO (and in 



particular BPFO) has the same expressive power as FO. As there is also a 0-1-law for the 
logic L^^ [23j, we actually get the following stronger statement: 

Observation 3.4. Every formula ip G PL^ojI®] i^ equivalent to a formula (p' € FO[0]. 

As FO+C is strictly stronger than FO even on structures of empty vocabulary, this obser- 
vation implies that there are queries definable in FO+C, but not in (B)PL^^. 

Furthermore, the Sipser-Gacs Theorem j24j that BPP C Sj n Ilg, the fact that the 
fragment Yj\ of second-order logic captures S2 |1H [36], and the observation that BPFO ^ 
BPP imply the following: 

Observation 3.5. BPFO ^ S|. 

We will use Lautemann's proof of the Sipser-Gacs Theorem in section [5] in the context 
of monadic second-order logic. 

We close this section by observing that randomised logics without probability gaps are 
considerably more powerful than their non-randomised counterparts: 

Observation 3.6. Let /C be a class of finite structures such that there is a first-order 
formula ^Pc{x) defining a single element in each structure of fC. Then every S}-query on /C 
can be defined in PFO. 

Proof. Let 99 be a Sj-query on /C, i.e., ip is of the form 3Xi • • • 3X^-0, where the Xi are 
relation variables and ■0 is first-order. We replace each of the Xi by a random relation Ri 
of the same arity to get a new sentence ip' and introduce an extra unary random relation 
Rq. Then (p is equivalent to the PFO-sentence 

3x{Rox A ^ci^)) V p)' , 

because the first part is satisfied with probability exactly 1/2. □ 

Toda's Theorem [37] that the polynomial hierarchy is contained in P suggests that. 



in fact, every second-order query is definable in PFO. However, Toda's proof does not carry 
over easily to the PFO-case. Observation 13.41 suggests that some technical condition such 
as definability of an element of the structure is necessary to separate PFO from FO at all. 
One example of such a class /C is the class of all ordered structures, with ipc{x) defining the 
minimum element. 

4. Separation results for BPFO 

In this section we study the expressive power of the randomised logics RFO, CO-RFO, and 
BPFO. Our main results are the following: 

• RFO is not contained in C^^^ 

• BPFO is not contained in MSO on ordered structures 

• RFO is stronger than FO on additive structures 

A forteriori, the first and the third result also hold with BPFO instead of RFO, and the 
constructions used in their proofs are also definable in CO-RFO. 

It turns out that we need three rather different queries to get these separation results. 
For the first two queries this is obvious, because every query on ordered structures is 
definable in C^^. The third query (on additive structures) is readily seen to be definable 
in MSO. In fact, in Section [5] we show the following: 



• Any BPFO-definable query on additive structures can be defined in MSO. 

4.1. RFO is not contained in C^^^ 

Formulas of the logic C^^^ may contain arbitrary (not necessarily finite) conjunctions and 
disjunctions, but only finitely many variables, and counting quantifiers of the form 3-"'x (p 
("there exists at least n x such that ip"). For example, the class of finite structures of even 
cardinality can be defined in this logic by the sentence 



V 

*;>0 



i>2k 



XX=X A 



]>2fc+l 



xx=x 



Theorem 4.1 

hut not in C 



There is a class TCFX of structures that is definable in RFO and CO-RFO, 



oow 



Recall that by Observation 13.41 there also is a class of structures definable in FO+C < 
C^^, but not in BPFO. 

Our proof of Theorem 14.11 is based on a well-known construction due to Cai, Fiirer, 
and Immerman [6], who gave an example of a Boolean query in P that is not definable in 
C(X)W We modify their construction in a way reminiscent to a proof by Dawar, Hella, and 
Kolaitis [7] for results on implicit definability in first-order logic, and obtain a query TCJ-X 
definable in (CO-)RFO, but not in CJ^^. Just like in Cai, Fiirer and Immerman's original 
proof, the reason why C|^^ can not define our query TCJ-X is its inability to choose one out 
of a pair of two elements. Using a random binary relation this can - with high probability 
- be done in FO. 

We first review the construction of and then show how to modify it to suit our needs. 
Given a graph G = {V^E), Cai et al. construct a new graph C, replacing all vertices and 
edges of G with certain gadgets. We shall call graphs G' resulting in this fashion CFI-graphs, 
and will from now on restrict ourselves to connected 3-regular graphs G and CFI-graphs 
resulting from these. 



vertex 





edge 



a I 
a' 



H 



')b 



Y Straight 

^twisted , : : •' 

a''il^:)b 



a2 B2 



Figure 1: The gadgets for CFI-graphs. Dashed ellipses indicate groups of equivalent ver- 
tices. Vertex labels are not part of the actual structure. 

The construction is as follows: For each vertex in G, we place a copy of the gadget 
shown on the left of Figure [T] in G' . It has a group of four nodes (henceforth called centre 
nodes) plus three pairs of nodes, which are to be thought of as ends of the three edges 
incident with that node. For the time being, we think of the pairs as ordered from 1 to 
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3 and distinguish between the two nodes in each pair, say one of them is the a-node, the 
other one being the b node. Each of the four centre nodes is connected to one node from 
each pair, and each of them to an even number of a's. To illustrate this, the centre nodes 
are labelled with the even subsets of {1,2,3}. We also introduce an equivalence relation 
(or colouring, if you like) of nodes as shown in Figure [H so any isomorphism of the gadget 
necessarily permutes nodes within each edge group and the centre group. 




23 ) 



Figure 2: The CFI-graph construction for a part of a graph. Edge and nodes labels are not 
part of the actual graph. 



For each edge in G, we connect the a- and 6-nodes in the corresponding pairs as shown 
on the right of Figure [TJ We say an edge is "twisted" if the a-node of one pair is connected 
to the 6- node of the other and vice versa. This completes our construction of G'. For 
definiteness, when we speak of an edge group we mean an equivalence class of size two, 
and by a centre group we mean one of size four. An edget is a pair of edge groups which 
form an edge gadget as on the right of Figure [TJ Figure [2] shows the result of applying this 
construction to a small subgraph (a vertex with its three neighbours). 

Without the a- and 6-labels, we cannot decide which of the edges have been twisted. 
In fact there are only two isomorphism classes of CFI-graphs derived from G, namely those 
with an even number of edges twisted and those with an odd number (we call the latter 
ones twisted CFI-graphs). This relies on the fact that isomorphisms of the gadget on the 
left of Figure [1] are exactly those permutations swapping an even number of a's and Vs. 
Since we assume G to be connected, we can twist edges along a path between two nodes 
adjacent to twisted edges, reducing the number of twisted edges by two; cf. [H Lemma 6.2] 
for details. 

By [HI Thm. 6.4], if the original graph G has no separator of size at most s then the two 
isomorphism classes of CFI graphs derived from it can not be distinguished by a sentence 
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(p E C^^, i.e., by a C^^^ sentence with at most s distinct variables. In P, on the other 
hand, twisted CFI-graphs can easily be recognised: Choose exactly one node from each 
edge group and label this one a and the other one h. A centre node is connected to an even 
number of a's if and only if all four nodes in its centre group are. In this case we call the 
centre group even, otherwise we call it odd. Then a CFI-graph is twisted if and only if 

(number of odd centre groups + number of twisted edgets) is odd. 

We aim for a (co-)RFO-sentence which defines exactly the twisted connected 3-regular 
CFI-graphs. In view of the above P-algorithm, we are done if we can 

• express connectedness of the graph, 

• count edgets and centre groups modulo two and 

• choose one representative from each centre group, edge group and edget. 

For counting modulo two and to get representatives for centre groups and edgets, we 
augment the structures with a Boolean algebra in the following way: Let r be the vocabulary 
{E, ~, <, C, P, O}, with unary P and O, and binary E, ~, < and 1^. Let CJ-X be the class 
of structures A such that 

• E defines a 3-regular, connected CFI-graph on V{A) \ P{A), 

• (P(A), C) is a Boolean algebra !B, and O is true exactly for its members of even cardinality 

• < defines a linear order on the set of atoms of !B (and no other element of A is <-related 
to any other). 

• ~ defines an equivalence relation, where each equivalence class 

— contains one atom of 03 and the nodes of one edget 

— or contains one atom of !B and the nodes of one centre group 

— or consists of a single non-atom of !B. 

In particular, the number of atoms of the Boolean algebra *B is equal to the number of 
edgets plus the number of centre groups. Note also that we can distinguish the two edge 
groups in an edget because only nodes in the same edge group are connected to nodes in 
the same centre group. 

Theorem 4.2. The class CJ-X is definable in FO. The subclass TCJ-X of twisted CFI- 
graphs is definable in BPFO but not in C^^. 

Proof. That CJ-X is definable is easy to establish, the only subtlety being that 53 allows us 
to quantify over sets of centre groups, which makes connectedness expressible. 

The proof that TCJ-X is not definable in C^^ is the same as in [6] ; it is unaffected by the 
additional structure. Note that because the atoms are ordered, the Boolean algebra is rigid, 
i.e., it has no non-trivial automorphism, therefore the isomorphism group of a CFI-graph 
is not changed by adding the Boolean algebra. 

It remains to show that twistedness can be defined in BPFO. We pick one vertex from 
each edge group by viewing a random binary relation R as assigning an m-bit number to 
each vertex, where m is the number of atoms in the Boolean algebra. From each pair, we 
choose the vertex with the smaller number, expressed by 

^(x) := 3y( X ~ y A 3z(a(z) A -iRxz A Ryz A yw{w < z — )■ {Rxw o Ryw))^ j , 

where a{x) is an FO-formula satisfied exactly by the atoms of the Boolean algebra. It is 
easy to see that if the random relation R assigns a different set of atoms to the two vertices 
in each edge group, then ^ succeeds in picking exactly one vertex from each edge group, and 
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boolean algebra ^^^^^^^^ ^^^^^^^ ^^^ 



' M = P^ 



""- -" N = A\P^ 

Figure 3: The structures in B contain a Boolean algebra and a perfectly matched set. 

twistedness can then be checked by looking at the O-predicate of the element of *B which 
contains exactly the atoms equivalent to twisted centre groups or twisted edgets. 

To prove that the resulting formula has a large probability gap, we need to establish a 
high probability of success only for structures in the class CTX, because this class is FO- 
definable. But in such structures, the probability that the two nodes of an edge group are 
assigned the same number is 2""^, so by a union bound the probability that we successfully 
pick one node from each group is at least 

1 - m2-™ -^ 1 

because there are less than m edgets. Furthermore, we can check in FO whether there is 
an edge group whose members we can not distinguish, and choose to invariably reject or 
accept in these cases, resulting in an RFO or CO-RFO sentence, respectively. □ 

4.2. BPFO on ordered structures is not contained in MSO 

In the presence of a linear order, any query becomes definable in L^^, and the query TCJ-X 
becomes definable even in FO. However, randomisation adds expressive power to FO also 
on ordered structures: 

Theorem 4.3. There is a class B of ordered structures that is definable in BPFO, but not 
in MSO. 

Remember that monadic second-order logic MSO is the the fragment of second-order 
logic that allows quantification over individual elements and sets of elements. 

Let (Tep< '■= {<,-£', P}, with binary relations < and E, and a unary predicate P. We 
define two classes B' , B oi (T£;p<-structures (cf. Figure l42]l : 
B' is the class of all (T£;p<-structures A for which 

(1) E defines a perfect matching on the set M := P{A) 

(2) the set N := V{A) \ P{A) forms a Boolean algebra with the relation E and 

(3) no X G A^ and y £ M are ii^-related 

(4) < defines a linear order on the whole structure, which puts the M before the A^ and 
orders M in such a way that matched elements are always successive. 

It is easy to see that the class B' is definable in FO. B is the subclass of B' whose elements 
satisfy the additional condition 

2l^^l > |iV|^ (4.1) 

We will prove that B is definable in BPFO, but not in MSO. To prove that B is definable 
in BPFO, we will use the following lemma: 
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Figure 4: The Birthday Paradox with ei 
Pr(/ is injective). 



0.2, £2 = 0.5 and c = 4. Here, p denotes 



Lemma 4.4 (Birthday Paradox). Let m,n> 1 and let F : [n] ^ [m] be a random function 
drawn uniformly from the set of all such functions. 

(1) For any ei > and c > 21n— there is an Hc > 1 such that if n > Uc and m < ^^ we 



have 



(2) For any €2 > 0, if m > ^, then 



Pr(F is injective) < ei 
en 
Pr(i<' is injective) > 1 — £2 



Proof. For the first part, we note that 

n-l 

Pr[F injective) =11 



1 



— ) < n ^^p 



m 



— = exp 

m 



n{n — 1) 
2m 



For the second part, note that 
Pr(F not injective) = Pr 



Fii) = F{j) for alH <i) < 



i<j 



m 



i n 

m ~ 2m 



D 



Proof of Theorem \4.3\ To see that B is not definable in MSO, we use two simple and well- 
known facts about MSO. The first is that for every q > there are natural numbers 
p, m such that for all A; > 0, a plain linear order of length m is indistinguishable from the 
linear order of length m + k ■ p hy MSO-sentences of quantifier rank at most q. The same 
fact also holds for linear orders with a perfect matching on successive elements, because 
such a matching is definable in MSO anyway. The second fact we use is a version of the 
Feferman-Vaught Theorem (cf. |271 Thm. 1.5(ii)]): 

Theorem 4.5. Suppose two r-structures U and V satisfy the same MSO-sentences of 
quantifier rank up to q, and let W he another r-structure. Denote byUUW (resp. V UW ) 
the disjoint union of U (resp. V) and W . Then U L\W and V L\W satisfy the same 
MSO-sentences of quantifier rank up to q. 

The theorem also holds for the ordered disjoint union U< instead of the disjoint union, 
but in our case the elements of the individual structures in the disjoint union are definable 
anyway. If we put these two facts together, we see that for every q > there are p, m 
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such that for ah k, n the structure A £ B with parts M, N of sizes m, n, respectively, is 
indistinguishable from the structure A' with parts of sizes m + k ■ p and n. We can easily 
choose k and n in such a way that A £ B and A' ^ B. 

It remains to prove that B is definable in BPFO. Consider the sentence 



V3inj := \fx\fylx=y V PxV PyV 3z{Pz A -^{Rxz o Ryz)) 

which states that the random binary relation R, considered as a function 

/:A^^Pow(M), x^{y£M\Rxy} 

from N to subsets of M, is injective. By the definition of R, the function / is drawn 
uniformly from the set of all such functions. If we fix |A^|, the probability for / to be 
injective increases monotonically with \M\. Furthermore, for every structure in B', the size 
of N and M are a power of two and an even number, respectively. Thus either 

2l^-^l < - |iV|2 or 2l^'^l>|iV|2, 

and this factor of 4 translates into a probability gap for ipi^y in all sufficiently large structures 
in B' , by Lemma 14.41 with ei = 0.2, 62 = 0.5 and c = 4. The remaining finitely many 
structures in B' can be dealt with separately. D 

4.3. RFO is stronger than FO on additive structures 

Recall that an additive structure is one whose vocabulary contains a ternary relation +, 
such that j4|+ is isomorphic to ([0, \A\ — l],{(a,6, c) \a + b = c}). 

Theorem 4.6. There is a class A of additive structures that is definable in RFO and 
CO-RFO, but not in FO. 

Our proof uses the following result: 

Theorem 4.7 (Lynch [26]). For every A; G N there is an infinite set A^ ^ N and a d^ £¥i 
such that for all finite Qo,Qi C Ak with \Qo\ = \Qi\ or \Qo\ ,\Qi\ > dk the structures 
(N,+,Qo) Oi'^'d (N, +,Qi) satisfy exactly the same FO-sentences of quantifier rank at most 
k. 

Here (N, +,(5j) denotes a { + , P}-structure with ternary + and unary P, where + is 
interpreted as above and P is interpreted by Qi. For a finite set M C N we denote by 
maxM the maximum element of M. By relativising quantifiers to the maximum element 
satisfying P, we immediately get the following corollary: 

Corollary 4.8. Let k, A^, dk, Qo and Qi be as above. Then the (finite) structures 
([0, maxQo], +, Qo) o'^t? ([0, maxQi], +, Qi) satisfy exactly the same FO-sentences of quan- 
tifier rank at most k. 

We call a set Q C N sparse if |Q n {n, . . . , 3n}| < 1 for all n > 0. Note that if Q is sparse 
and finite, then \Q\ < log^{m.ax Q) -\- 1 . It is easy to see that there is an F0[{+, P}]-sentence 

'/'sparse SUch that 

([0, max Q], +, Q) |= (/^sparse <^ Q is sparse 
for all finite Q C N. 
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Proof of Theorem \4.6\ We define the following class of additive {+, P}-structures: 

A = {{[0, max Q],+,Q) \ Q is finite, sparse and \Q\ is even}, 

with + defined as usual. It follows immediately from Corollary 14.81 that A is not definable 
in FO. 

It remains to prove that A is definable in (co-)RFO. We consider a binary random 
relation R on Q = ([0, max Q], +, Q) for some finite Q C N. 

Each element a £ [0,niaxQ] defines a subset of Q, namely the set of 5 G Q for which 
(a, b) £ R{Q) holds. If Q is a sparse set, it has 

2\Q\ < 2i°g3(maxQ)+i <■ vaa.xQ 
~ ~ 21n(max(5) 

many subsets, and by standard estimates on the coupon collector's problem (see, e.g., [29] : 
or use a union-bound argument), if maxQ is large enough, with high probability every 
subset of Q is defined by some element of [0,max(5]. We may check in FO whether this is 
actually the case. If so, we use the random relation R and the linear order induced by + to 
check whether Q is even. Otherwise we reject (accept) to get an RFO- (co-RFO-)sentence. 

D 

5. BPFO is contained in MSO on additive structures 

In this section, we prove our first and only nontrivial derandomisation result. It comple- 
ments the result of Section [4.21 by saying that, on additive structures, every BPFO-sentence 
is equivalent to an MSO-sentence. 

Theorem 5.1. Let t be a finite relational vocabulary containing a ternary relation -\- and 
let ip be a BPFO[t] -sentence. Then there exists an MSO-sentence ip such that on additive 
structures A 

A\= if -^ A 1=11;. 



We first use Nisan's pseudorandom generator for constant depth circuits [32] to reduce 
the number of random bits to log^ ' n; throughout this section, n will denote the size of 
the input structure. We then derandomise the resulting formula following Lautemann's 
argument in [2l]. The second-order quantifier depth of the resulting MSO formula does not 
depend on the input formula ip. 

In MSO[+], one can define a multiplication relation (see [35\ Lemma 5.4]) and thus 
quantify over pairs of elements in [0,y^]. We only need the existence of such a pairing 
function, a slightly weaker form of which is made precise in the following lemma: 

Lemma 5.2 (Pairing Lemma). There are l\ASO[+]-formulas fp{x) and c^/. . .\(x, y, z, w) 
such that on additive structures A 



(^p(x) defines a number p satisfying 



<P< 



2 

Moreover, p is a prime number. 

For every b,c < p there is a unique m such that 99/. . .\(0, 6, c, m) is satisfied. Furthermore, 
for every m there is a unique tuple {a,b,c) G [0,p — 1]^ such that ip(^...^{a,b,c,m) is 
satisfied. Henceforth we write m = {a, b, c) for this. 

16 



Proof. In MSO[+], we may define a formulas ipx={x)iX,x) and (/'divides (^^j 2/) stating that X 
is the set of multiples of x and x divides y, respectively. We may thus check whether x is 
a prime number. Furthermore, we may define the set of powers of a prime number x: It is 
the largest set containing only numbers whose only prime divisor is x. 

Then p is the largest prime number whose set of powers contains at least one element 
other that and itself. Any number m G [0,p^ — 1] may be written as m = bp + c with 
6, c G [0,p — 1]. Both b and c are definable in MSO[+]; notice that b is the largest divisor of 
m — c smaller than p, or if m < p. For m > p'^ we define m = {a, b, c) with a S {1, 2, 3} 
and ?n — ap'^ = (0, 6, c). D 

Whenever we write p in this section, we mean the p defined by the (pp above. The Pairing 
Lemma allows us to quantify over binary relations on [0,p— 1] = Fp. In particular, we may 
define addition and multiplication modulo p, i.e., there are MSO[+]-formulas (p-^-{x,y, z) 
and ipx {x, y, z) such that for a,b,c £ Fp, 

A\= ip+{a,b,c) 44> a + b = c {mod p) 

and 

A\= ipx{a,b,c) 4^ a-b = c (modp). 
For the proof of Theorem 15. II we may assume that the BPFO-sentence ip contains only 
one random relation, say R of arity r. In fact, using the formulas ^Pi-th defining the i-th. 
element of an additive structure (cf. section 12. ip we may pack several random relations 
Ri, . . . ,Rk of arities ri, . . . , r^ into one random relation R of arity r = 1 + max{ri, . . . , r^} 
by replacing every occurrence of RiXi . . . x^ by 

3y(9?i-th(y) Ai? y...y xi...xrj. 

{r—ri) times 

We first apply a result by Nisan |32] to reduce the number of random bits: 



Lemma 5.3. For every r, d G N and e > there are rig G N and MSO[+]-forniulas (pi{x) 
and {ppj.g{S,xi, . . . ,Xr), where S is a set variable, such that in every additive structure A of 
size n > uq, 

• ipi defines a number I < log ^ ' n and 

if (p is an FO[r U {R}]-sentence of quantifier rank < d, where r is some finite relational 
vocabulary and R is of arity r, then 



• 



Pr (X 1= ip) - Pr (A 1= ip'iS)) 

X(iX(A,{R}) SQ[l\ 



<e, 



where ip' is the y\S>0[+]- formula obtained from ip by replacing every occurrence of Rx by 

Vprg{S,x). 

Proof. For any fixed structure A of size n we may construct a polynomial-sized circuit 
Cip^A of depth < d which describes the behaviour of <y9 on (r U {i?})-extensions of A. The 
circuit has n^ inputs indexed by the elements of V{AY , and an input vector x denotes the 
(r U {i?})-extension B^ of A given by 

a £ R{Bg) iff x^ = 1. 

Then C^^^(x) evaluates to 1 iff B^ |= ip. 

Nisan [32] gave a pseudorandom generator for such circuits which hinges on the following 
lemma: 
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Figure 5: Nisan's pseudo-random bit generator. The sets Ai C {1, ...,/} form a partial- 
(logn,m)-design, i.e., they satisfy \Ai\ = m and \Ai D Aj\ < logn for sdl 1 < i ^ 
j < n. 



Lemma 5.4 (restated from \62\ Lemma 2.2]). Let {C„} he a family of circuits of depth d 
and polynomial size, let m = 'm{n) = {YognY^^ , I = l{n) and suppose for each n the sets 

4"\...,^1"^^[/] satisfy 



A 



(n) 



A^r^ n ^(") 



Then 



m for all 1 < i < n and 

< logn for all 1 < i ^ j < n. 

\Fr{Cnix) = 0)-Pr{Cni(B^eA^y^ 



neA, 



.y^) = o)\< 



1 



n^ 



for any c G N and large enough n. Here, the first probability is taken uniformly over all 
strings x E {0, 1}", whereas the second is taken uniformly over all strings y G {0, 1}'. 

The resulting pseudorandom generator is depicted in Figure O Families of sets A^ 
satisfying the above conditions are called partial-{\ogn,m)- designs. Nisan gives a construc- 
tion with I = m? = log^ ' n, which drastically reduces the size of the probability space, 
i.e., the number of random bits needed. We now show how his construction can be defined 
in MSO[+]. 

On [0,p— 1], we may define a formula ip\o^{x, y) which is satisfied iff x = [log2 y] . Using 
this and the fact that 

2[logp] - 1 < [logn] < 2[logpl + 2, 

we let ipm{x) and ^i{x) be two formulas defining natural numbers m and / such that 

• m is a prime number between (r^flogn])'^"'"^ and 2(r^([logn] + 3)*^"*"^ 

• I = nn? 

Using the pairing function f{■^■^ we may assume that R is a, 3r-ary relation which we 
only need to define for elements in Fp. That is, we define ipprg{S, xi, . . . , Xr) by 

32;i • • • 3^3,. Xi = {zi,Z2,Z3) A ... AXr = (23^-2, ^Sr-l, ^Sr) A ip'p,g{S, Zi, . . . , Z3r) 

The formula (/?pj,g(5, z) takes the parity of a subset of S indexed by z: 

(p'pj.g{S, f) := " |5 n ip{A; z)\ is even" , 
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where tp{x,z) is an MSO[+]-formula and tp{A\z) := {x\ A |= ^(x,^)}; evenness may be 
expressed in MSO on ordered structures. By Lemma 15.41 we are done if we can define a 
formula tp{x,z) such that 
(i) V(^;-?) C [/] for allzGF^^ 

(ii) \ip{A; z)\=m for all z £ F^'', and 

(iii) \tP{A; zi) D i/jiA; Z2)\ < logn for all zi / £2 G F^'', 

which means the sets ip{A; z) form a partial-(log n, m)-design. We use the same construction 
as Nisan: We interpret the tuple ^ as a polynomial /^ S Fmi'?] of degree < logn. The set 
ip{A; z) is then the graph of this polynomial, namely 

i;{A; z) = {(C, hm U e F^} C F^, 

and we identify F^ with [/]. We first encode the coefficients of /^ into a set variable X as 
follows: Consider the binary representations 



i>o 



ij2^ with Zij G {0, 1} 



of the Zi. We can define an MSO[+]-sentence V9pack(-?) X) which holds iff X, interpreted as 
a binary relation over Fp, holds exactly for pairs (a, h) with 

< a < [logp] and h= ^ 2i,a2*~^ 

l<i<3r 

Thus for each < a < [logp] there is exactly one b = b[a) with (a, 6) G X, and all 
6s are between and 2^*^, and thus in F^ if n is large enough. We may now define an 
MSO[+]-sentence ipeva\{X,u,v) which, for these Xs, holds iff 

V = fy{u) = Y^ b{a)u'', 

0<a<[logp] 

with addition and multiplication according to F^- Putting these ingredients together, we 
define 

ip{x, z) = 3X3u3v "0 <u,v< m" A ^pe,ck{z, X) A ipcvaiiX, u, w) A "x = n • ?n + v" , 

which is easily verified to satisfy conditions (i) to (iii) above. □ 

So far we have reduced the number of random bits from n*" to / = log^ •* n, and these 
are conveniently packed into the first / bits of a single set variable S. We may now follow 
Lautemann's proof [2lj to derandomise this sentence. 

Proof of Theorem \5.1\ After applying Lemma F5.3l we are left with MSO[+]-sentences ipi and 
if' such that ^pi defines a number I < log ^ ' n and if' has a free set variable S. We may 
assume that for all additive structures A, 

either Pr (A 1= ip'(S)) <\ or Pr (A 1= tf'(S)) > 1 - t, (5-1) 

because otherwise we may use independent repetition and majority vote to obtain these 
bounds. To be precise, let xiS,i,j) be defined by 

x{S,iJ) := (0< i </) A(0<i < l)A3z{z=i-l + j ASz). 

That is, we divide the first P bits of S into / blocks of / bits each, and let x{S,i,j) select 
the i-th bit of the j'-th block. We replace each occurrence of Sx in (/?' by x{S-, i, x) to obtain 
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a formula (p'{S,i). Because / is of order log ' ' n, we may quantify over pairs of elements 
of [0, / — 1], which allows us to express the formula 

^'{S) = "(^'(5, i) holds for at least half of the i G [0, / - 1]" 

in MSO[+], e.g., by stating that there exists a matching M on [0, / — 1] such that 

• if {i,j} G M, then exactly one of (p'{S,i) and (p'{S,j) holds and 

• all i G [0, / — 1] for which (p'{S, i) does not hold are matched by M. 

Then (p' uses P = log ^ ' n many bits of S, and by the Chernoff bound on the tails of the 
binomial distribution it satisfies ()5.ip . even with I replaced by f' (details can be found in 
[21 sec. 7.4]). 

We identify subsets of [/] with vectors in Fg- Let M C Fg be the set of vectors for 
which A \= ip'{S) holds. Equation (j5.ip translates into 



\M\ < 
For a vector y G Fg we define 



|^2| 



or \M\ > ( 1 - -7 



F^ 



y®M:={x®y\x£ M} 
to be the set M translated by y. We claim the following: 
(a) If |M| < IF2I /l, then for every choice of vectors yi, . . . ,yi we have 

IJ {y,(BM)^¥l 



l<i<l 



(b) If |Af I > (1 — l/l) iFg], then there are vectors yi, . . . ,yi such that 

U (y,eM) = F^. 



l<i<l 



The first claim follows immediately from |y © M\ = \M\. For (b), assume that we randomly 
choose the vectors yi independently and uniformly from F2. For any vector x G F2 we have 

Pr (x \J{yi © M)) = Y[ Pr(f y ^ © M) 



< 



by the independence of the yi. But then the expected number of vectors not in lj(yi © M) 



IS 



E 



F'2 \ \J{m © M) 1 = J] Pr (f \Jiyi © M) 



< 



\fI\ 



li 



<1, 



so there must be a choice of y^s such that this number is zero, i.e., [j{yi © M) = ¥2- 

Again using the formula xiS,i,j), we can pack the vectors yi,...,yi into a single 
existentially quantified set variable and check that [J{yi © M) = F2 as follows: 

ip" = 3YyX3iy,'{X(Bx{Y,i,-)), 



20 



where ^'{X © x(^)^) ')) is the formula ^'{S) with every occurrence of Sx replaced by 

{Xx A x{y, i, x)) V i^Xx A ^xiY, i, x)). 
Claims (a) and (b) imply that 

A 1= if" ^ Pt{A 1= ip'{S)) > 1 - y, 
which completes the proof. □ 



6. A logic capturing BPP 

In this section, we prove that the logic BPIFP+C captures the complexity class BPP. Tech- 
nically, the results of this section are closely related to results in |17j . 

Counting logics like FO+C and IFP+C are usually defined via two-sorted structures, 
which are equipped with an initial segment of the natural numbers of appropriate length. 
The expressive power of the resulting logic turns out to be rather robust under changes in 
the exact definition, see |33j for a detailed survey of this. However, we will only need the 
limited counting ability provided by the Rescher quantifier, which goes back to a unary 
majority quantifier defined in [3l], see [33] . 

We let F0(i7) be the logic obtained from first-order logic by adjoining a generalised 
quantifier J, the Rescher quantifier. For any two formulas ^pi{x) and <p2{x), where x is a 
/c-tuple of variables, we form a new formula 

Jx.ipi{x)ip2{x). 

Its semantics is defined by 

A \= Jx.ifi{x)ip2ix) iff 

{a G V{A)'' I A 1= 99i[a]}| < \{a G V{A)'' \ A ^ ip2[a]}\ . (6.1) 

The logic IFP(^7) is defined similarly. 

Lemma 6.1. Let R be a 6-ary relation symbol. There is a formula ip<:{x,y) G F0(i7)[{-R}] 
such that 



lim Pr (Ua,b) I A \= ip<\a,b]\ is a linear order ofV(A)] = 1. 

(Recall that Sn is the ^-structure with universe {1, . . . ,n}. Thus X{Sn, {R}) just denotes 
the set of all {R} -structures with universe {1, . . . , n}.j 

Proof. We let 

ip<{x, y) := Jxi . . . x^.Rxxi . . . X5 Ryxi . . . X5. 

To see that ip< (x, y) defines an order with high probability, let ^ be a structure with universe 
V{A) = {1, . . . , n}. For each a G V{A), let 

Xa:= \{aeV{Af\A^Raa.}\ 

Then A \= (p<{a, b) iff Xa < ^6, and c/9< linearly orders A iff the Xa are pairwise distinct. 
But for a ^ b ^ ^(^)) the random variables Xa and X^, are independent and each is 
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binomially distributed with parameters p = 1/2 and m = n^, and thus 



k=0 



k 22™ '^-^ I A; 



1 -^ /m\ ( m \ _ 1 /2m\ _ p^ / 1 



2'^"' ^^ \k J \m - kj 2'^"^\mJ \\/rn^ 

where the final approximation can be found, for example, in [13]. The second part now 
follows by a union bound over the (g) = 0(m2/^) pairs a ^ b. □ 

Theorem 6.2. The logic BPIFP(J) captures BPP. 

Proof. BPIFP(^7) is contained in BPP, because a randomised polynomial time algorithm 
can interpret the random relations by using its random bits. 

For the other direction, let Q be a Boolean query in BPP. This means that there is a 
randomised polynomial time algorithm M that decides the query Q< of ordered expansions 
of structures in Q. We may view the (polynomially many) random bits used by M as 
part of the input. Then it follows from the Immerman-Vardi Theorem that there is a 
BPIFP-sentence 'i/'M defining Q<. Note that, by the definition of Q<, this sentence is order- 
invariant. We replace every occurrence of < in •0jV'/ by the formula ip<{x,y) of Lemma |6. 11 
which with high probability defines a linear order on the universe. □ 

It is easy to see that BPIFP+C is also contained in BPP and that IFP(J') ^ IFP+C. 
Thus we get the following corollary. 

Corollary 6.3. BPIFP+C = BPIFP(J), and both capture BPP. 

Remark 6.4. Lemma [6. II also implies that BPL'^^{J') = BPC^^, because, in the presence 
of an ordering, a quantifier of the form 3-'^x tp may be spelled out as 



y /\3x {ipi.th{x) A (pix)), 



\S\=n 

where (pi-th{x) defines z-th element in the linear order (cf. section [2?T]) . 

In fact, because the formulas (pi use only three distinct variables independent of i, any 
query is definable in L^^ on ordered structures, as well as on BPC^^. 



7. Summary and Open Problems 

Our main motivation for introducing randomised logics was to apply tools from finite model 
theory to problems in computational complexity theory, and possibly vice versa. Because 
most capturing results from descriptive complexity remain valid when both the logic and 
the complexity class they involve are randomised in the same way, our definitions are indeed 
suitable for this purpose. In particular, the capturing results by Barrington et al. [3] for 
F0[-|-, x] and Behle and Lange [5] for F0[<] and F0[-|-] fall into this category. 

This asks for a more detailed investigation of the expressive power of randomised logics. 
For example, we have shown that BPFO[-|-] can not be derandomised, while conditional de- 
randomisation results for dlogtime- uniform BPAC (cf. |39] ) suggest that BPFO[-|-, x] might 
be derandomisable. As this question seems to elude currect techniques, a first step might be 
to find some relation R for which BPFO[i?] is derandomisable. Note that derandomisability 
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of non-uniform BPAC implies the existence of an infinite sequence {Ri)i>i of relations for 
which BPFO[i?i, R2, . . .] is derandomisable. 

One obstruction to proving results about randomised logics is that, for example, Eh- 
renfeucht-Fraisse games become quite complicated on structures with both a random and 
a non-random part. In [10], the first author proves some non-definability results for BPFO, 
namely that, on vocabularies with only unary relations, BPFO can be derandomised, and 
that the ordering relation < can not be defined in BPFO from its corresponding successor 
relation. A natural next step would be to prove whether BPFO can be derandomised on 
word models or not. 
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